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METHOD OF RESEARCHING AND ANALYZING INFORMATION 
CONTAINED IN A DATABASE 



5 Field of the Invention 

The present invention relates to a method of researching and analyzing information 
contained in databases and more particularly to information contained in electronic 
databases accessible over the Internet. 

1 0 Background of the Invention 

A large number of databases are available in the public domain that are accessible over 
the Internet and contain a plethora of information. A database is defined as a collection 
of data organized especially for rapid search and retrieval as by a computer. The data 
may be text documents and/or images or numbers. 

15 

One example of a database is the patent database displayed in the United States Patent 
and Trademark Office (USPTO) web page. A provider, in this case the USPTO 
organization, provides and displays documents (i.e., patents) in the database and defines a 
method of searching the database using standard patent fields, i.e., patent number, 
20 inventor, assignee, issue date and title, among others. A user may also search the 
database by entering a search query consisting of specific keywords encoded in a 
Boolean formalism. The result of the database search is a list of documents that contain 
the standard patent fields or the keywords the user requested. 

25 However, in many cases a user searches for specific type of information which although 
it may be contained in the database documents it cannot be directly accessed using the 
conventional search methods because the specific search query was not envisioned by the 
provider or was not intended to be searched. These specific types of search queries 
require advanced search methods and are used in research applications. 

30 
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One such advanced search method is described in U.S. Patent 6,038,561 , where a 
dynamic concept (or "natural language") query is performed. A user enters a list of 
words ranging from a single keyword to an entire document in a user-specified query 
document. This user-specified query document is then compared for similarity to a set of 
5 documents contained in the database and measures of similarity scores are obtained. 
These measures of similarity scores provide answers regarding patent infringement 
between two patents, or synergy between companies and inventories, among others. 

However, in general research applications, a user seeks answers to a "new set of 
10 questions" and is not looking to develop a similarity analysis between two documents but 
rather to develop "a thesis about a new subject matter". The "new subject matter" may 
be an assessment of the technical capabilities of a given company, a business strategy, a 
marketing analysis, type of material or human resources required to set-up a specific 
operation or to develop a specific type of technology. This type of research is usually 
15 performed manually in a non-systematic way. It is also cumbersome and takes a long 
time. 

There is a need for an advanced method of researching electronically information in 
existing databases in order to develop via analysis and/or synthesis a new type of 
20 information database and ultimately "a thesis about a new subject matter". 

Summary of the Invention 

In general, in one aspect, the invention provides a method of researching and analyzing 
25 information contained in a plurality of documents. The documents belong to a first 
database and are organized according to a first set of fields for an electronic search and 
retrieval by a computer. The method includes first conducting an electronic search of the 
first database to retrieve at least one document, then developing a second set of fields and 
then reading the at least one document to retrieve information pertaining to the user 
30 defined fields. The next step is entering into a second database the at least one document, 
the values of the first set of fields for the at least one document, the second set of fields 
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and the retrieved information pertaining to the second set of fields and finally analyzing 
the information that is contained in the second database. 

Implementations of the invention may include one or more of the following features. The 

5 first database may be searched based on the first set of fields, keywords, a Boolean 
phrase and on a "natural language" query. The first database may be a patent database, 
the documents may be patents and the first set of fields may be patent fields. The patent 
database may be the United States Patent and Trademark Office patent database, the 
European Patent Office patent database, the Japanese Patent Office patent database and 

10 an international patent database. The patent fields may be selected from a group 
consisting of Patent Number, Title, Assignee Name and Location, Filing date, Date of 
patent, Application Number, Inventor(s) Name, U.S. Class, U.S. Subclass, International 
Class, International Subclass, Field of Search, Number of references cited, Number of 
claims, Number of independent claims, Abstract, Name of Primary Examiner, Name of 

15 secondary Examiner or Name of attorney, agent, firm. The documents may be 
publications, books, newspapers, and magazines. The second set of fields may be user- 
defined fields and may be developed from answers to questions contained in a first input 
form. The user-defined field may include a core competency of an invention, a 
technology of an invention, an element of an invention, a business driver for an 

20 invention, a product of an invention, a field of an invention, a problem solved by an 
invention, an intellectual property strategy for an invention, a priority of an invention, a 
capability for producing an invention, a level of novelty of an invention, a market size for 
an invention, a maturity level of an invention, a level of importance of an invention, a 
patent strategy of an invention, a business strategy of an invention, ability to detect use of 

25 an invention, an estimate of use of an invention by competitors, an estimate of use of an 
invention by suppliers, an estimate of use of an invention by customers, an estimate of 
alliance potential generated by an invention, an estimate of technology transfer 
potential generated by an invention, a level of prestige generated by an invention, an 
estimate of licensing potential of an invention and an estimate of market potential 

30 generated of an invention. The second database may be an Access™ type database. 
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The step of analyzing the information contained in the second database includes entering 
the information into a spreadsheet and displaying the user-defined fields in graphs and 
tables. Each user-defined field may be associated with a value and the analyzing includes 
summation of the values for each user-defined field. More than one document may be 
5 retrieved. The method may also include developing a high level of abstraction view of 
the retrieved documents. 

In general, in another aspect, the invention features an apparatus for researching and 
analyzing information contained in a plurality of documents including a computer system 

10 having a display and a central processing unit (CPU). The computer system accesses an 
Internet website. The Internet website has a first database that contains a plurality of 
documents organized according to first set of fields for an electronic search and retrieval 
by the computer system. The computer system also includes one or more computer 
instructions for conducting an electronic search of the first database to retrieve at least 

15 one document, one or more computer instructions for developing a second set of fields, 
one or more computer instructions for entering into a second database the at least one 
document, values of the first set of fields for the at least one document, the second set of 
fields and information pertaining to the second set of fields extracted via reading from the 
at least one document and one or more computer instructions for analyzing the 

20 information contained in the second database. The second database may be contained 
within the computer system. 

Among the advantages of this invention may be one or more of the following. The 
invention provides a method of analyzing patents and extracting information pertaining to 
25 market, business, IP strategy, business strategy and technology for one or more 
inventions. 

The details of one or more embodiments of the invention are set forth in the 
accompanying drawings and description below. Other features, objects and advantages of 
30 the invention will be apparent from the following description of the preferred 
embodiments, the drawings and from the claims. 
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Brief Description of the Drawings 

FIG. 1 is a block diagram of a patent mapping process according to this invention. 

5 

FIG. 2 is a block diagram of a first input form. 
FIG. 3 is a block diagram of a second input form. 
1 0 FIG. 4 is a worksheet input file for the core competencies field. 
FIG. 5 is a worksheet input file for the technologies field. 
FIG. 6 is a worksheet input file for the elements field. 

15 

FIG. 7 is a worksheet input file for the business driver's field. 
FIG. 8 is a worksheet input file for the product categories field. 
20 FIG. 9 is a worksheet input file for the fields of invention field. 
FIG. 10 is a worksheet input file for the problems field. 
FIG. 1 1 is a worksheet input file for the IP strategy field. 

25 

FIG. 12 is a worksheet input file for the priorities field. 
FIG. 13 is a worksheet input file for the invention view field. 
30 FIG. 14 is a worksheet input file for the applied invention matrix (AIM) class field. 
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FIG. 1 5 is a worksheet input file for the market size field. 
FIG. 1 6 is a worksheet input file for the maturity of technologies field. 
5 FIG. 17 is a worksheet input file for the importance of products field. 
FIG. 1 8 is a worksheet input file for the patent strategy field. 
FIG. 19 is a worksheet input file for the business strategy field. 

10 

FIG. 20 is a worksheet input file for the detectability field. 
FIG. 21 is a worksheet input file for the competitors use field. 
1 5 FIG. 22 is a worksheet input file for the suppliers use field. 
FIG. 23 is a worksheet input file for the customers use field. 
FIG. 24 is a worksheet input file for the alliance potential field. 

20 

FIG. 25 is a worksheet input file for the technology transfer field. 
FIG. 26 is a worksheet input file for the prestige field. 
25 FIG. 27 is a block diagram of a third input form. 

FIG. 28 is a block diagram for the high level of abstraction (HLA) Framework. 
FIG. 29 is a worksheet input file for the HLA framework. 

30 

FIG. 30 illustrates a representative screen shot for a patent record in the database 510. 
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FIG. 31 is a bar graph depicting the IP space for electronic equipment occupied by four 
competing companies A, B, C and D. 

FIG. 32 is a bar graph depicting the technology space occupied by the four competing 
5 companies A, B, C and D of FIG. 3 1 . 

FIG. 33 is a bar graph depicting the business drivers for the four competing companies A, 
B, C and D of FIG. 31. 

10 FIG. 34 is a bar graph depicting the core competencies for the four competing companies 
A, B,CandDofFIG. 31. 

FIG. 35 is a bar graph depicting the year the patents belonging to the four competing 
companies A, B, C and D of FIG. 31 were issued. 

15 

FIG. 36 is a Venn diagram depicting a first example of the IP and product space occupied 
by companies A and B. 

FIG. 37 is a Venn diagram depicting a second example of the IP and product space 
20 occupied by companies A and B. 

FIG. 38 is a Venn diagram depicting a third example of the IP and product space 
occupied by companies A and B. 

25 FIG. 39 is an apparatus for implementing the mapping process of this invention. 

Like reference numbers and designations in the various drawings indicate like elements. 
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Detailed Description of the Preferred Embodiment 

Referring to FIG. 1, a patent mapping process 100 includes the following steps. First a 
user fills out input forms 302, 304 and 306. Input forms 302, 304 and 306 are used to 
identify the user's areas of interests including product, technology, process, business, and 

5 market, among others. The information captured in input form 302 is used to define 
fields for an electronic search query 202 and to construct a filter 230. The information 
captured in input forms 304 and 306 is used to construct a High Level of Abstraction 
(HLA) framework 401 and user-defined fields 300. These user-defined fields 300 
represent the specific interests of the user or the specific questions a user would like to 

10 answer. 

Referring to FIG. 2, in first input form 302 the user is asked to list all disclosures 520, 
patents 521 and technical papers 522 known to the user in the areas of interest, define the 
technology areas, scopes and boundaries for the mapping process 523 and list all major 
15 competitors 524, suppliers 525 and customers 526 in the areas of interest. 

Referring to FIG. 3, in second input form 304, the user is asked to answer questions in 
specific areas including core competencies 530, technologies 531, elements 532, business 
drivers 533, products 534, fields of the invention 535, problems solved 536, intellectual 

20 property (IP) strategy 537, priorities 538, invention view capabilities 539, applied 
invention matrix (AIM), i.e., level of novelty 540, market size 541, maturity of 
technology 542, importance of products 543, patent strategy 544, business strategy 545, 
detectability of the invention 546, use of the invention by competitors 547, suppliers 548, 
and customers 549, alliance potential 550, technology transfer potential 551 and prestige 

25 552. For each specific area a spreadsheet type input file is filled out that includes a list of 
all relevant parameters, their definition and a measure of their importance for the 
development of the specified product or technology. The information captured in input 
form 304 is user specific and varies from user to user. This information is used to 
transform user-specific conditions, constrains and measures into user-defined fields 300 

30 and HLA framework 401 , shown in FIG. 1 . 
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In the question about core competencies, the user is asked to describe the technical 
expertise and skills that are needed to produce the inventions of interest. Examples of 
core competencies include mechanical engineer, software engineer, economist, database 
analyst, market analyst and marketing. Referring to FIG. 4, the core competencies input 

5 file for the development of a drug product includes a first column that lists all the type of 
special expert skills that are needed to produce the drug, a second column that describes 
each skill and a third column with a weight factor having values ranging from 1 to 10 
indicating the weight of importance for each skill. In the example of FIG. 4, the expert 
skills needed to produce the drug are organic chemist, chemical engineer, biochemist, 

10 toxicologist, physical chemist, analytical chemist and a medical doctor. 

In the question about technologies, the user is asked to describe the technologies utilized 
in developing the inventions of interest. Examples of technologies include robotics, 
semiconductor processes, quality control and quantitative analysis. Referring to FIG. 5, 
15 the input file for the technologies includes a first column that lists all the technologies, a 
second column that describes each technology and a third column with weight factors 
having values from 1 to 10 indicating the weight of importance for each technology. 
Among the technologies needed to produce the drug of FIG. 4 are organic chemistry, 
analytical chemistry, animal studies and manufacturing processes. 

20 

In the question about elements, the user is asked to describe the principal elements of the 
inventions. Examples of elements for an apparatus invention include all the actual 
elements of the invention. For a method type invention the elements include all the steps 
or processes of the invention. Referring to FIG. 6, the input file for the elements includes 
25 a first column that lists all the elements, a second column that describes each element and 
a third column with weight factors having values from 1 to 10 indicating the weight of 
importance for each element. Among the elements of the drug of FIG. 4 are its chemical 
formulation, user-friendly package and patient reaction. 

30 In the question about business drivers, the user is asked to describe the commercial 
advantages of the inventions of interest. Examples of business drivers include higher 
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yield, reduced cost, improved uniformity and reduced size. Referring to FIG. 7, the input 
file for the business drivers includes a first column that lists all the commercial 
advantages of the invention, a second column that defines each advantage and a third 
column with weight factors having values from 1 to 10 indicating the weight of 
5 importance for each advantage. Among the business drivers for the invention of the drug 
of FIG. 4 are its market response, user convenience and new treatment method. 

In the question about products and product categories, the user is asked to describe the 
products of the inventions of interest. Examples of products include electrodes, business 

10 forecasting, business plans. Referring to FIG. 8, the input file for the products includes a 
first column that lists all the products, methods and processes that the invention produces, 
a second column that describes each product and a third column with weight factors 
having values from 1 to 10 indicating the weight of importance for each product. 
Among the product categories of the drug of FIG. 4 are included estrogen, calcitonin, 

1 5 estrogen inhibitors, disease treating and preventing drugs. 

In the question about fields of invention, the user is asked to describe the specific fields 
of the inventions of interest. Examples of fields of invention include etching, 
temperature control, data processing, and data compression. Referring to FIG. 9, the 
20 input file for the fields of the invention includes a first column that lists all the specific 
fields of the invention, a second column that defines each field and a third column with 
weight factors having values from 1 to 10 indicating the weight of importance for each 
field. Among the fields of the invention for the drug of FIG. 4 are included synthetic 
chemistry, formulation and method of use of products. 

25 

In the question about problems, the user is asked to describe the major technical problems 
that the invention solves. Examples of problems include how to resist corrosion, 
compress data, and accelerate electrons. Referring to FIG. 10, the input file for the 
problems of the invention includes a first column that lists all the major problems that 
30 one would like to solve with the invention, a second column that describes each problem 
and a third column with weight factors having values from 1 to 10 indicating the weight 
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of importance for each problem. Among the problems of the invention for the drag of 
FIG. 4 are included how to improve of bioavailability, how to improve drug acceptance 
and how to increase drug efficacy. 



5 In the question about Intellectual Property (IP) strategy, the user is asked to rate the 
technologies or products of the inventions of interest according to the patent strategy of 
the user's company. The company patent strategy is used to plan product and technology 
development and to secure a competitive position in the market. If a company does not 
have a defined patent strategy this field is left empty and an initial map is completed 

10 without this information. The results of the initial map are used to develop a company 
patent strategy. This developed patent strategy is then used to update the results of the 
initial patent map. Referring to FIG. 11, the input file for the intellectual property 
strategy includes a first column that lists all the possible IP strategies for each technology 
or product of the invention, a second column that describes each product or technology, 

15 and a third column with weight factors having values from 1 to 10 indicating the weight 
of importance for each technology or product. In one example, the IP strategies include 
patenting around a competitor's patents, publishing in front of a competitor's patents, 
publishing around a competitor's patents, increasing understanding of a competitor's 
patents and belonging not in area of interest of a competitor's patents. 

20 

In the question about priorities, the user is asked to prioritize each technology and or 
product of the inventions. Referring to FIG. 12, the input file for the priorities of the 
technologies and products includes a first column that rates qualitatively the level of 
priority and a second column that lists each technology and/or product of the invention. 
25 In one example, the levels of priorities are rated as emergency, high, medium and low. 

In the question about invention view, the user is asked to describe the capabilities needed 
to produce the products of the inventions of interest. Referring to FIG. 13, the input file 
for the invention view includes a first column that lists the user's own capabilities, 
30 competitor's capabilities and supplier's capabilities and a second column that describes 
user's capabilities, competitors' capabilities and suppliers' capabilities, respectively. 
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In the question about applied invention matrix (AIM) class, the user is asked to 
categorize the technology fields of the inventions of interest by the level of novelty. The 
novelty level is characterized as "breakthrough" for a very broad fundamental technology 
5 or product, "distinctive" for a unique method, offbeat approach, or intermediate 
improvement and "incremental" for a very detailed incremental improvement. Referring 
to FIG. 14, the input file for the AIM Class of the invention includes a first column that 
rates the novelty level and a second column that describes each product or technology of 
the invention. 

10 

In the question about market size, the user is asked to estimate the market size range for 
the technology fields and products of the inventions of interest. Referring to FIG, 15, the 
input file for the market size includes a first column that rates the market size range and a 
second column that describes each product or technology of the invention. In one 
15 example, the market size ranges include 1 million to 10 million dollars, 10 million to 100 
million dollars, 100 million to 1 billion dollars, lbillion to 5 billion dollars and greater 
than 5 billion dollars. 

In the question about maturity of technology, the user is asked to rate the maturity of the 
20 technology and products of the inventions of interest. Referring to FIG. 16, the input file 
for the maturity of technologies includes a first column that rates the technology maturity 
level, a second column that describes each product or technology of the invention and a 
third column with weight factors having values from 1 to 10 indicating the weight of 
importance for each technology or product. In one example, the maturity level is 
25 characterized as "last generation", "current generation" and "emerging generation". 

In the question about importance of the invention to current or planned products the user 
is asked to rate the importance of the invention for current or planned company products. 
Referring to FIG. 15, the input file for the importance of the products includes a first 
30 column that rates the importance of a product, a second column that describes each 
product or technology of the invention and a third column with weight factors having 
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values from 1 to 10 indicating the weight of importance. In one example, the importance 
is characterized as "peripheral" for user or supplier related inventions, "element of a 
product" for a product manufactured by the user's company and is interchangeable with 
another type of product from the user's company or from another company, "essential to 
5 a product" for a product essential to the user's product line and "creates a product" if the 
invention can be used to make the entirety of a new product. 

In the question about patent strategy, the user is asked to rate the strategic importance of 
the invention to the overall company patent strategy. Referring to FIG. 18, the input file 

10 for the patent strategy includes a first column that rates the invention according to the 
company's patent strategy, a second column that describes each product or technology of 
the invention and a third column with weight factors having values from 1 to 10 
indicating the weight of importance. In one example, the different categories for the 
strategic importance of the invention to the overall company patent strategy include, "no 

15 patent strategy in the invention field", "high strategic importance", "medium strategic 
importance" and "low strategic importance". 

In the question about business strategy, the user is asked to rate the importance of the 
technologies or products of the invention to the overall company business strategy. 

20 Referring to FIG. 19, the input file for the business strategy includes a first column that 
rates the business strategy, a second column that describes each product or technology of 
the invention and a third column with weight factors having values from 1 to 10 
indicating the weight of importance. In one example, the different categories for the 
strategic importance of the invention to the overall company business strategy include, 

25 "no business strategy in the invention field", "high business importance", "medium 
business importance" and "low business importance". 

In the question about detectability, the user is asked to rate the technologies or products 
of the invention by the ability to detect their use by others. In general, it is easy to detect 
30 the use of an apparatus or a product of an invention by others whereas the use of methods 
and processes are often undetectable. Referring to FIG. 18, the input file for the 



13 



detectability includes a first column that rates the detectability, a second column that 
describes each product or technology of the invention and a third column with weight 
factors having values from 1 to 10 indicating the weight of detectability. In one example, 
the ability to detect the use of an invention by others is rated as "obvious", "easily 
5 detected", "detectable with work" and "undetectable". In the drug example of FIG. 4, the 
methods for coating, tabletting, packaging, use and combination with other products are 
rated as "obvious". The methods of chemical formulation and synthetic chemistry are 
rated as "detectable with work". 

10 In the question about competitive use, the user is asked to rate the technologies or 
products of the invention by the percentage of competitors who would use them. 
Referring to FIG. 21, the input file for the competitive use includes a first column that 
rates the competitive use and a second column that describes each product or technology 
of the invention. In one example, the ratings for the competitive use include "less than 

15 10% of competitors will use", " 10% to 50% of competitors will use", " most competitors 
will use", "unknown" and "will only be used by user's company". In the drug example of 
FIG. 4, it is estimated that most competitors will use the methods of tabletting, coating 
and packaging, whereas only 10 % to 50% percent of the competitors will use the 
methods of formulation, use, synthetic chemistry and combination of products. 

20 

In the question about suppliers' use, the user is asked to rate the technologies or products 
of the invention by the percentage of suppliers who would use them. Referring to FIG. 
22, the input file for the suppliers' use includes a first column that rates the suppliers use 
and a second column that describes each product or technology of the invention. In one 
25 example, the ratings for the suppliers' use include "less than 10% of suppliers will use", " 
10% to 50% of suppliers will use", " most suppliers will use", and "unknown". In the 
drug example of FIG. 4, it is estimated that most suppliers will use the synthetic 
chemistry method, whereas less than 10 % percent of the suppliers will use the methods 
of formulation, use, tabletting, coating, packaging methods and combination of products. 

30 
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In the question about customer use, the user is asked to rate the technologies or products 
of the invention by the percentage of customers who would use them. Referring to FIG. 
23, the input file for the customer use includes a first column that rates the customer use 
and a second column that describes each product or technology of the invention. In one 
5 example, the ratings for the customer use include "no customer will use", "less than 10% 
of customers will use", " 10% to 50% of customers will use", " most customers will use", 
and ''unknown". In the drug example of FIG. 4, it is estimated that no customer will use 
the methods for synthetic chemistry, chemical formulation, use, tabletting, coating, 
packaging methods and combination of products. 

10 

In the question about alliance potential, the user is asked to rate the technologies or 
products of the invention by the level of potential current or anticipated alliance activity 
that the invention will enable between the user's company and other companies. 
Referring to FIG. 24, the input file for the alliance potential includes a first column that 

15 rates the alliance potential and a second column that describes each product or 
technology of the invention. In one example, the alliance potential is rated as "low", 
"medium", "high" and "unknown". In the drug example of FIG. 4, the alliance potential 
for the method for tabletting is rated as "low", for the methods of packaging and 
formulation as "medium" and for the methods for synthetic chemistry, coating, use and 

20 combination of products as "high". 

In the question about technology transfer, the user is asked to rate the technologies or 
products of the invention by the level of current or anticipated technology transfer that 
the invention will enable between the user's company and other companies. Referring to 

25 FIG. 25, the input file for the technology transfer includes a first column that rates the 
technology transfer probability and a second column that describes each product or 
technology of the invention. In one example, the technology transfer potential is rated as 
"low", "medium", "high" and "unknown" In the drug example of FIG. 4, the technology 
transfer potential for the methods of tabletting and coating is rated as "low", for the 

30 methods of packaging and formulation as "medium" and for the methods for synthetic 
chemistry, use and combination of products as "high". 
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In the question about prestige, the user is asked to rate the technologies or products of the 
invention by the level of prestige or recognition that may bring to the inventors and the 
company. Referring to FIG. 26, the input file for the prestige includes a first column that 
5 rates the prestige level and a second column that describes each product or technology of 
the invention. In one example, the prestige level is rated as "low", "medium" and "high". 
In the drug example of FIG. 4, the prestige level for the methods of tabletting and 
formulation is rated as "low", for the methods of packaging, coating and synthetic 
chemistry as "medium" and for the methods of use and combination of products as 
10 "high". 

Referring back to FIG. 1, the information captured in third input form 306 is used to 
develop a High Level of Abstraction (HLA) framework 401 for a specific technology or 
product. Referring to FIG. 27, third input form 306 includes a first step 560, where the 
15 user is asked to separate the products and technologies into general categories, a second 
step 562, where the user is asked to separate the general categories into a first set of sub- 
categories, a third step 564, where a user is asked to separate each first sub-category into 
another level of sub-categories and a fourth step 566, where the user is asked to enter the 
results from the first, second and third steps into a spreadsheet. 

20 

Referring to FIGS. 28 and 29, the HLA framework 401 includes a main category 412 and 
a first and second subcategories, 414 and 416, respectively. In one example, the specific 
product is a computer 420 and the HLA main categories 412 are the processor 422, 
monitor 424, and the input devices 426. The first subcategories 414 of the processor 422 

25 main category are the motherboard 428, the graphics board 430 and the disk drives 432. 
The first subcategory of the monitor 424 main category is the screen 434. The first 
subcategories of the input devices 426 are the mouse 436 and the keyboard 438. The 
second subcategories 416 of the motherboard 428 first subcategory are the CPU 440, the 
cache 442, and the ports 444. The second subcategories 416 of the mouse 436 first 

30 subcategory are the buttons 446. The HLA framework is applied to the group of relevant 
patents 500 to form HLA clusters 402 of the collected patents. Each patent in the group 
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of relevant patents 500 is evaluated and placed in the appropriate HLA cluster 450. Both 
the formation of HLA clusters and the extraction of information to fill-out the user- 
defined fields occur during the reading and evaluation of the patents 310. 



5 Referring again to FIG. 1, after filling out input forms 302, 304 and 306 the user conducts 
an electronic search 202 of the USPTO patent database 200. The USPTO database 200 is 
indexed according to standard patent fields 210, which are listed in Table 1. The user can 
search the USPTO database 200 based on one or more of the standard patent fields 210. 
In one example, an electronic search is conducted for all patents issued to a specific 

10 assignee in a given specific time period. The first patent field is the assignee name and 
the second is the year. The result of such an electronic search is a list of patents 220 
which belong to the specified assignee and were issued on a specified. In addition to a 
patent field, the USPTO database can also be search via a "keyword". In this case one or 
more keywords are formulated into a Boolean phrase and the database is searched for 

15 patents that contain the defined Boolean phrase. A user can also search the USPTO 
database via a "natural language" query described in U.S. 6,038,561, incorporated herein 
by reference. 

Continuing to refer to FIG. 1, in the next step a filter 230 is applied to the list of patents 
20 220. Filter 230 is developed based on the information contained in the input form 302. 
The result of filtering is a list of more relevant patents 500. As was mentioned above, 
based on the information contained in the second input form 304 user-defined fields 300 
are developed. The user-defined fields 300 are used to evaluate the list of relevant 
patents 500. The evaluation 310 includes reading each patent and extracting information 
25 pertaining to the user defined fields 300. These user-defined fields 300 are not directly 
described in the specifications of the relevant patents 500. In some cases they are 
mentioned in the patents but in most cases they are inferred by the description of the 
invention. In one example, when a material is described to be processed by chemical 
vapor deposition (CVD) it can be inferred that the capabilities needed to produce this 
30 material include a CVD reactor, expertise and equipment in high vacuum, specialty gases 
and scrubbing systems for poisonous gases. To conduct this type of evaluation 310 a 
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knowledgeable individual reads and analyzes each patent in the group of relevant patents 
500 and extracts information pertaining to each user-defined field 300 either directly or 
via inference. 

5 In the next step a data value system (DVS) 502 receives the information pertaining to the 
specific patent field queries 250 5 the information pertaining to the specific user-defined 
field queries 350 and the clustered patents 402 and enters them into a DVS database 510. 
In one example the DVS database 510 is formulated as an Access™ type database 
provided by Microsoft Inc. An Access™ type database can accommodate a large number 

10 of documents and each document is allowed to have several fields. Information of 
predetermined types is entered into the fields and the information is accessed via a query 
based on the same predetermined fields. In addition to this type of field based query, the 
Access™ type database provides a query language so that a user can form relational 
inquires into the database. An example of a relational inquiry for database 510 includes a 

1 5 specific patent number "AND" a business driver field "FOR" a specific technology field. 

Referring to FIG. 30, a screen shot of an Access™ type database 510 includes for each 
patent belonging to group 500 an index number 610, the information for all the requested 
patent fields including, patent number 611, issue date 612, class 613, subclass 614, 

20 inventor names 615, assignee name 616, title 617, abstract 618, number of total claims 
619, number of independent claims 620, number of citations 621, number of citations 
originating not from the applicants 622, number of content terms in first claim 623, and 
number of content terms in exemplary claim (not shown). The database also includes the 
information for all the user-defined fields 300, such as, core competency 530, technology 

25 531, elements 532, business drive 533, product 534, field of invention 535, problems 
solved 536, IP strategy 537, priority 538, invention type 626, invention view 539 and 
novelty level (AIM) 540. Each patent is also assigned a cluster number 624. 

Referring to Table 3, the information contained in the Access ™ database 510 is used to 
30 construct an Excel™ type spreadsheet 650. Spreadsheet 650 has columns 660 including 
all patent fields 651, all user-defined fields 300 and all the values associated with the 
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user-defined fields. The values are summed for each user defined field and the sums are 
displayed on the spreadsheet (not shown) and on the screen shot 625, shown in FIG. 30. 
The Excel™ spreadsheet is used to construct Pivot™ Tables, Graphs and Patent Maps 504 
and to analyze the data 506, as shown in FIG. 1. Examples of analyses include a product 
5 analysis, a technology analysis, a business driver's analysis, a core competency analysis, 
a timing analysis, a silver bullet analysis, a claims analysis, a patent assessment tool 
analysis and a summary analysis. 

Referring to FIG. 31, in one example, the IP space for electronic equipment manufactured 
10 by four competing companies, A, B, C and D is segmented by specific product type. The 
largest product segment includes patents on low noise operational amplifiers and 
company C occupies the majority space in this segment. The smallest product segment 
includes patents on bandgap temperature sensors and companies A and C are the two 
major competitors sharing this segment space. Other segments include patents on active 
15 filter, balanced mixer circuit, bipolar transistors, digital-analog converters, gating lateral 
transistor structure, multi stage resistive ladder, operational amplifiers (other than low 
noise or fixed gain), operational amplifiers with fixed gain, optically coupled isolator 
circuits, switches, voltage reference circuits and voltage regulators. 

20 Referring to FIG. 32, in one example, the technology IP space of the same four major 
competitors A, B, C and D of FIG. 31 is segmented by specific processes and 
technologies. The segments include analog circuitry design, BiCMOS analog process, 
Bipolar analog processes, CMOS analog processes, IC Package, logic circuitry design 
and semiconductor processes. The largest technology segment includes patents on analog 

25 circuitry design and companies A and C dominate this segment. The smallest technology 
segment includes patents on semiconductor processes and companies B and C dominate 
this segment. 

Referring to FIG. 33, the business drivers of the same four major competitors A, B, C and 
30 D of FIG. 31 include better performance, increase battery life, increase functionality, 
increase versatility, meet the new market needs, reduce cost, reduce power consumption, 
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reduce size and weight. The main drivers are better performance, reduce power 
consumption, increase battery life and reduce cost. 

Referring to FIG. 34, the core competencies of the same four major competitors A, B, C 
5 and D of FIG. 31 include analog designer, device engineer, logic engineer, system 
engineer and process engineer. The main core competency for all four companies is the 
analog designer. 

Referring to FIG. 35, a timing analysis of the same four major competitors A, B, C and D 
10 of FIG. 31 indicates that the majority of the patents were issued in years 1998, 1999 and 
2000 and the majority of these patents were granted to company D. Company B owns 
most of the patents issued in the early years from 1973 to 1982. Company B continues 
to patent inventions in this field, but not to the same extend as companies A, C and D. 

15 Based on the analysis of the patent mapping results the IP space occupied by a company 
is mapped out in connection to the IP space occupied by other companies, the space 
occupied by its own products and the space occupied by other companies products. 
Referring to FIG. 36, the IP space of company A, represented by Venn diagram 802, 
intersects the space where company A has products, represented by Venn diagram 808, 

20 and is tangential to the IP spaces of company B IP space, represented by Venn diagram 
806. Company B IP space 806 intersects the space where company B has products, 
represented by Venn diagram 804 and the product space 804 is tangential to the IP space 
of company A 802. However, the product space of company A 808 intersects both the 
IP space 806 and product space 804 of company B. The common area 810 between 

25 company A product space 808 and company B IP space 806 is called the dark space and 
indicates a space where company B has patents and company A has products which may 
violate company B's patent space. Dark space 810 identifies licensing opportunities for 
company B. Area 812 where company A has products but neither company A or 
company B have patents is called the gray space. Grey space 812 identifies opportunities 

30 for patenting for both company A and company B. 
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Referring to FIG. 37, company B's product space 804 intersects company A's IP space 
802 and company B's IP space. The intersect space between company A' IP space and 
company B's product space minus the space occupied by company B's IP space defines a 
green space 814 where company A has IP and company B has products violating 
5 company A's IP space. The green space 814 identifies licensing opportunities for 
company A. 

Referring to FIG. 38, there is a fourth space area called white space, represented by Venn 
diagram 820 where no patents exist, but company A has a developed a capability, 
10 represented by Venn diagram 816 and there is a market need, represented by Venn 
diagram 818. White space 820 identifies new opportunities for patenting for company A. 

Referring to FIG. 39, an apparatus 850 for implementing the mapping process 100 
includes a computer system 851, connected to the Internet 862. The computer system 

15 851 includes a CPU unit 852, a screen 854, a keyboard 856, and a mouse 858. The CPU 
unit includes additional components of the computer system 851 (not shown) such as a 
processor, memory, a disk drive, a compact disc read only memory and an interface 
system for connecting to the Internet 862. The USPTO has a Webpage 864 accessible via 
the Internet 862. The USPTO Webpage 864 contains the searchable patent database 200. 

20 A search query is send from the computer system 851 to the USPTO database 200. The 
search results are received by the computer system 851 and entered in database 500 
housed in the computer system 851. The computer system 851 is part of a network (not 
shown) that includes other computer systems connected with each other and with 
computer system 851 so that database 500 can be shared and viewed simultaneously by 

25 several users. 

In other embodiments the searchable database 200 of FIG. 1 may be the European Patent 
Office (EPO) database, the Japanese Patent Office database or an international patent 
database. Database 200 may also contain publications, magazines, books, and websites. 

30 
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The many features and advantages of the present invention are apparent from the detailed 
specification, and, thus, it is intended by the appended claims to cover all such features 
and advantages of the described apparatus that follow the true spirit and scope of the 
invention. Furthermore, since numerous modifications and changes will readily occur to 
5 those of skill in the art, it is not desired to limit the invention to the exact construction 
and operation described herein. Moreover, the process and apparatus of the present 
invention, like related apparatus and processes used in medical applications tend to be 
complex in nature and are often best practiced by empirically determining the appropriate 
values of the operating parameters or by conducting computer simulations to arrive at a 
10 best design for a given application. Accordingly, other embodiments are within the scope 
of the following claims. 
What is claimed is: 

15 



20 



25 



30 
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1. A method of researching and analyzing information contained in a plurality of 
documents, wherein said documents belong to a first database and are organized 
according to a first set of fields for an electronic search and retrieval by a 
computer, the method comprising: 
5 conducting an electronic search of said first database to retrieve at least 

one document; 

developing a second set of fields; 

reading said at least one document to retrieve information pertaining to 
said second set of fields; 
10 entering said at least one document, values of said first set of fields for 

said at least one document, said second set of fields and said retrieved information 
pertaining to said second set of fields into a second database; and 

analyzing said information contained in said second database. 

15 2. The method of claim 1 wherein said first database is searched based on said first 
set of fields. 

3 . The method of claim 1 wherein said first database is searched based on keywords. 

20 4. The method of claim 1 wherein said first database is searched based on a Boolean 
phrase. 

5. The method of claim 1 wherein said first database is searched based on a "natural 
language" query. 

25 

6. The method of claim 1 wherein said first database is a patent database, said 
documents are patents and said first set of fields are patent fields. 

7. The method of claim 6 wherein said patent database is the United States Patent 
30 and Trademark Office patent database. 



23 



8. The method of claim 6 wherein said patent database is the European Patent Office 
patent database. 

9. The method of claim 6 wherein said patent database is the Japanese Patent Office 
5 patent database. 

10. The method of claim 6 wherein said patent database is an international patent 
database. 

10 11. The method of claim 6 wherein said patent fields are selected from a group 
consisting of Patent Number, Title, Assignee Name and Location, Filing date, 
Date of patent, Application Number, Inventor(s) Name, U.S. Class, U.S. Subclass, 
International Class, International Subclass, Field of Search, Number of references 
cited, Number of claims, Number of independent claims, Abstract, Name of 

15 Primary Examiner, Name of secondary Examiner or Name of attorney, agent, 

firm. 

12. The method of claim 1 wherein said documents are publications. 
20 13. The method of claim 1 wherein said documents are books. 

14. The method of claim 1 wherein said documents are newspapers. 

15. The method of claim 1 wherein said documents are magazines. 

25 

16. The method of claim 1 wherein said second set of fields are user-defined fields 
and are developed from answers to questions contained in a first input form. 

17. The method of claim 16 wherein said user-defined fields comprise a core 
30 competency of an invention. 
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18. The method of claim 16 wherein said user-defined fields comprise a technology 
of an invention. 

19. The method of claim 16 wherein said user defined fields comprise an element of 
5 an invention. 

20. The method of claim 16 wherein said user defined fields comprise a business 
driver for an invention. 

10 21. The method of claim 1 6 wherein said user defined fields comprise a product of an 
invention. 

22. The method of claim 16 wherein said user defined fields comprise a field of an 
invention. 

15 

23. The method of claim 16 wherein said user defined fields comprise a problem 
solved by an invention. 

24. The method of claim 16 wherein said user defined fields comprise an intellectual 
20 property strategy for an invention. 

25. The method of claim 16 wherein said user defined fields comprise a priority of an 
invention. 

25 26. The method of claim 16 wherein said user defined fields comprise a capability for 
producing an invention. 

27. The method of claim 16 wherein said user defined fields comprise a level of 
novelty of an invention. 

30 
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28. The method of claim 16 wherein said user defined fields comprise a market size 
for an invention. 

29. The method of claim 16 wherein said user defined fields comprise a maturity 
5 level of an invention. 

30. The method of claim 16 wherein said user-defined fields comprise a level of 
importance of an invention. 

10 31. The method of claim 16 wherein said user-defined fields comprise a patent 

strategy of an invention. 

32. The method of claim 16 wherein said user-defined fields comprise a business 
strategy of an invention. 

15 

33. The method of claim 16 wherein said user-defined fields comprise an assessment 
of the ability to detect the use of an invention by anyone other than an owner of 
the invention. 

20 34. The method of claim 16 wherein said user-defined fields comprise an estimate of 
use of an invention by competitors. 

35. The method of claim 16 wherein said user-defined fields comprise an estimate of 
use of an invention by suppliers. 

25 

36. The method of claim 16 wherein said user-defined fields comprise an estimate of 
use of an invention by customers. 

37. The method of claim 16 wherein said user-defined fields comprise an estimate of 
30 the alliance potential generated by an invention between an owner of the 

invention and others. 
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38. The method of claim 16 wherein said user-defined fields comprise an estimate of 
technology transfer potential generated by an invention between an owner of the 
invention and others. 

5 

39. The method of claim 16 wherein said user-defined fields comprise a level of 
prestige generated by an invention. 

40. The method of claim 16 wherein said user-defined fields comprise an estimate of 
1 0 licensing potential of an invention. 

41. The method of claim 16 wherein said user-defined fields comprise an estimate of 
market potential of an invention. 

15 42. The method of claim 1 wherein said second database comprises an Access™ type 

database. 

43. The method of claim 1 wherein said analyzing of the information contained in 
said second database comprises: 

20 entering the information into a spreadsheet; and 

displaying said user-defined fields in graphs and tables. 

44. The method of claim 1 wherein each said second set of fields is associated with a 
value and said analyzing comprises summation of said values for each field. 

25 

45. The method of claim 1 wherein more than one documents are retrieved. 



46. The method of claim 45 further comprising developing a high level of abstraction 
view of said retrieved documents. 

30 
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47. An apparatus for researching and analyzing information contained in a plurality 
of documents comprising: 

a computer system including a monitor and a central processing unit 
(CPU), said computer system accessing an Internet website; 
5 a first database contained within said Internet website and wherein said 

database contains a plurality of documents organized according to a first set of 
fields for an electronic search and retrieval by said computer system, 

computer instructions for conducting an electronic search of said first 
database to retrieve at least one document; 
1 0 computer instructions for developing a second set of fields; 

computer instructions for entering into a second database said at least one 
document, values of said first set of fields for said at least one document, said 
second set of fields and information pertaining to said second set of fields 
extracted via reading from said at least one document; and 
15 computer instructions for analyzing information contained in said second 

database. 

48. The apparatus of claim 47 wherein said second database is contained within said 
computer system. 

20 
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METHOD OF RESEARCHING AND ANALYZING INFORMATION 
CONTAINED IN A DATABASE 

5 

Abstract of the Disclosure 

This invention describes a method of researching and analyzing information 
contained in documents that belong to a first database and are organized according to first 
set of fields for an electronic search and retrieval by a computer. The method includes 

10 the steps of conducting an electronic search of the first database to retrieve at least one 
document, developing user-defined fields, reading the at least one document to retrieve 
information pertaining to the user defined fields, entering the at least one document, 
values of the first set of fields for the at least one document, the user-defined fields and 
the retrieved information pertaining to the user defined fields into a second database and 

15 analyzing the information contained in the second database. An example of a first 
database is the USPTO patent database and the documents are patents. In this case a user 
fills out input forms describing and capturing specific interests, which are used to develop 
user-defined fields. Examples of such user-defined fields include among others core 
competencies, technologies, elements, business drivers, products, fields of the invention, 

20 problems solved, intellectual property strategy, priority, capabilities, level of novelty, 
market size, maturity of technology, patent strategy, business strategy, importance of 
products, detectability of the invention, use of the invention by competitors, suppliers, 
and customers, alliance potential or technology transfer potential and prestige. 

25 
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Step One: Main Categories 560 

To create the HLA framework, the products or technology must first be separated into 
general categories. For example, a computer system might be separated into the 
general categories of the processor, the monitor and input devices. 




1 




Step Two: Sub-Category 1 562 

Next, separate the general categories into sub-categories. For example, the processor 
might be separated into the motherboard, the graphics board and the disk drives. 




+ 




Step Three: Sub-Category 2 564 

Next, if possible, separate each sub-category 1 entry into another level of sub- 
categories. For example, the motherboard might be separated into the CPU and the 
cache. 








Step Four: Add to Spreadsheet 566 

Finally, add this information to the spreadsheet by double clicking the icon below. 
Place the main categories in the "HLA - Main" column, the first sub-categories in the 
"Sub-Category 1" column, and the second sub-categories in the "Sub-Category 2" 
column. 
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DECLARATION AND POWER OF ATTORNEY FOR PATENT APPLICATION 



As a below-named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name; I 
believe I am an original, first and joint inventor of the subject matter claimed in the 
present application and for which a patent is sought on the invention entitled: 



METHOD OF RESEARCHING AND ANALYZING INFORMATION CONTAINED 
IN A DATABASE . . 

the specification of which (check one): 

_X_ is attached hereto; 

was file on , as United States Application Serial No. • 

I hereby state that I have reviewed and understand the contents of the above-identified 
specification, including the claims and the drawings, and including any amendments 
referred to herein. 

Duty of Disclosure and Truthfulness of Information Provided 

I acknowledge the duty to disclose information that is material to the patentability of the 
above-identified invention of the present application in accordance with Title 37, Code of 
Federal Regulations, section 1.56. 

I hereby declare that all statements made herein of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, Title 18 of the United States 
Code, section 1001, and that such willful false statements may jeopardize the validity of 
this application or any patent issuing thereon. 
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Express Mafl No. f=U KXP^W ^OS 



Attorney Docket No. iPCG-'0^3 



Foreign Priority 

I hereby claim foreign priority under Title 35, United States Code, section 119(a)-(d) of 
any foreign application(s) for patent or inventor's certificate listed below and have also 
identified below any foreign application for patent or inventor's certificate have a filing 
date before that of the application for which priority is claimed: 

Prior Foreign Applications): 

Number Country Day/Month/Year Priority Claimed (Yes/No) 



U.S. Provisional Application Priority 

I hereby claim benefit under Title 35, United States Code, section 119(e) of any United 
States provisional patent applications listed below: 

Provisional Application Serial No. Filing Date 



U.S. Non-Provisional Application Priority 

I hereby claim the benefit under Title 35, United States Code, section 120, of any United 
States application(s) listed below and, insofar as the subject matter of each of the claims 
of this application is not disclosed in the prior United States application in the manner 
provided by the first paragraph of Title 35, United States Code, section 112, I 
acknowledge the duty to disclose information material to the patentability of the above- 
identified invention as defined in Title 37, Code of Federal Regulations, section 1.56, 
which occurred between the filing date of the prior application and the national or PCT 
international filing date of this application: 

Prior U.S. Applications: 

Serial No. Filing Date Status (patented, pending, abandoned) 
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Power of Attorney 

As a named inventor, I hereby appoint the following attorneys and/or agents to prosecute 
this application and to transact all business in the United States Patent and Trademark 
Office connected therewith: 

Aliki K. Collins, Ph.D., Reg.Nr.: 43558 

Send all correspondence to: 
Attention: Aliki K. Collins, Ph.D. 
IPCapitalGroup, Inc 
400 Cornerstone Drive, Suite 325 
Williston, VT 05495 
Tel: 802-872-3200 
FAX: 802-288-9468 



Direct all telephone calls to: Aliki K. Collins, Ph.D., Reg.Nr.: 43558, 
Tel: 802-872-3200, X 279 



(continue to next page) 
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INVENTORS 



Full Name of Inventor: John Edward Cronin 

Signature of Inventor: ^^\aJ£ (Lflf*~- '^g^ c37 , 

Post Office Address: 130 Sunset Circle, Milton, VT 05468 
Citizenship: United States of America 



Full Name of Inventor: YuWangBibby 
Signature of Inventor: /^/J^ x ^t^cv.^ ^ ($eI/L, 



a 




Post Office Address: 275 French Hill Road, St Albans, VT«05478 
Citizenship: United States of America 
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